智能论文笔记

One-Shot Learning of Stochastic Differential Equations with Computational Graph Completion

Matthieu Darcy , Boumediene Hamzi , Giulia Livieri , Houman Owhadi , Peyman Tavallali

分类： (统计)机器学习 | 机器学习

2022-09-24

我们考虑了从一个示例轨迹中学习$ dx_t = f（x_t）dt+sigma（x_t）dw_t $的形式的随机微分方程的问题。这个问题比学习确定性动力学系统更具挑战性，因为一个示例轨迹仅提供有关未知功能$ f $，$ \ sigma $的间接信息，而随机过程$ dw_t $代表漂移，扩散和随机强迫术语，强迫术语，，分别。我们为此问题提出了一个简单的基于内核的解决方案，可以分解如下：（1）表示时间添加映射$ x_t \ rightarrow x_ {t+dt} $作为计算图，其中$ f $，$ \ \ Sigma $和$ DW_T $作为未知功能和随机变量出现。（2）通过在未知函数上使用高斯过程（GP）先验的最大后验估计（给定数据）来完成图（近似未知的函数和随机变量）。（3）从具有随机交叉验证的数据中学习GP先验的协方差函数（内核）。数值实验说明了我们方法的功效，鲁棒性和范围。

translated by 谷歌翻译

Uncertainty Quantification of the 4th kind; optimal posterior accuracy-uncertainty tradeoff with the minimum enclosing ball

Hamed Hamze Bajgiran , Pau Batlle Franch , Houman Owhadi , Mostafa Samir , Clint Scovel , Mahdy Shirdel , Michael Stanley , Peyman Tavallali

分类：机器学习

2021-08-24

基本上有三种不确定性量化方法（UQ）：（a）强大的优化，（b）贝叶斯，（c）决策理论。尽管（a）坚固，但在准确性和数据同化方面是不利的。（b）需要先验，通常是脆弱的，后验估计可能很慢。尽管（c）导致对最佳先验的识别，但其近似遭受了维度的诅咒，风险的概念是相对于数据分布的平均值。我们引入了第四种，它是（a），（b），（c）和假设检验之间的杂种。可以总结为在观察样本$ x $之后，（1）通过相对可能性定义了可能性区域，（2）在该区域玩Minmax游戏以定义最佳估计器及其风险。最终的方法具有几种理想的属性（a）测量数据后确定了最佳先验，并且风险概念是后部的，（b）确定最佳估计值，其风险可以降低到计算最小封闭的最小封闭式。利益图量下的可能性区域图像的球（这是快速的，不受维数的诅咒）。该方法的特征在于$ [0,1] $中的参数，该参数是在观察到的数据（相对可能性）的稀有度上被假定的下限。当该参数接近$ 1 $时，该方法会产生一个后分布，该分布集中在最大似然估计的情况下，并具有较低的置信度UQ估计值。当该参数接近$ 0 $时，该方法会产生最大风险后验分布，并具有很高的信心UQ估计值。除了导航准确性不确定性权衡外，该建议的方法还通过导航与数据同化相关的稳健性 - 准确性权衡解决了贝叶斯推断的脆弱性。

translated by 谷歌翻译

Efficient Relation-aware Neighborhood Aggregation in Graph Neural Networks via Tensor Decomposition

Peyman Baghershahi , Reshad Hosseini , Hadi Moradi

分类：机器学习

2022-12-11

Numerous models have tried to effectively embed knowledge graphs in low dimensions. Among the state-of-the-art methods, Graph Neural Network (GNN) models provide structure-aware representations of knowledge graphs. However, they often utilize the information of relations and their interactions with entities inefficiently. Moreover, most state-of-the-art knowledge graph embedding models suffer from scalability issues because of assigning high-dimensional embeddings to entities and relations. To address the above limitations, we propose a scalable general knowledge graph encoder that adaptively involves a powerful tensor decomposition method in the aggregation function of RGCN, a well-known relational GNN model. Specifically, the parameters of a low-rank core projection tensor, used to transform neighborhood entities in the encoder, are shared across relations to benefit from multi-task learning and incorporate relations information effectively. Besides, we propose a low-rank estimation of the core tensor using CP decomposition to compress the model, which is also applicable, as a regularization method, to other similar linear models. We evaluated our model on knowledge graph completion as a common downstream task. We train our model for using a new loss function based on contrastive learning, which relieves the training limitation of the 1-N method on huge graphs. We improved RGCN performance on FB15-237 by 0.42% with considerably lower dimensionality of embeddings.

translated by 谷歌翻译

Image Deblurring with Domain Generalizable Diffusion Models

Mengwei Ren , Mauricio Delbracio , Hossein Talebi , Guido Gerig , Peyman Milanfar

分类：计算机视觉

2022-12-04

Diffusion Probabilistic Models (DPMs) have recently been employed for image deblurring. DPMs are trained via a stochastic denoising process that maps Gaussian noise to the high-quality image, conditioned on the concatenated blurry input. Despite their high-quality generated samples, image-conditioned Diffusion Probabilistic Models (icDPM) rely on synthetic pairwise training data (in-domain), with potentially unclear robustness towards real-world unseen images (out-of-domain). In this work, we investigate the generalization ability of icDPMs in deblurring, and propose a simple but effective guidance to significantly alleviate artifacts, and improve the out-of-distribution performance. Particularly, we propose to first extract a multiscale domain-generalizable representation from the input image that removes domain-specific information while preserving the underlying image structure. The representation is then added into the feature maps of the conditional diffusion model as an extra guidance that helps improving the generalization. To benchmark, we focus on out-of-distribution performance by applying a single-dataset trained model to three external and diverse test sets. The effectiveness of the proposed formulation is demonstrated by improvements over the standard icDPM, as well as state-of-the-art performance on perceptual quality and competitive distortion metrics compared to existing methods.

translated by 谷歌翻译

Fast Algorithm for Constrained Linear Inverse Problems

Mohammed Rayyan Sheriff , Floor Fenne Redel , Peyman Mohajerin Esfahani

分类：机器学习

2022-12-02

We consider the constrained Linear Inverse Problem (LIP), where a certain atomic norm (like the $\ell_1 $ and the Nuclear norm) is minimized subject to a quadratic constraint. Typically, such cost functions are non-differentiable which makes them not amenable to the fast optimization methods existing in practice. We propose two equivalent reformulations of the constrained LIP with improved convex regularity: (i) a smooth convex minimization problem, and (ii) a strongly convex min-max problem. These problems could be solved by applying existing acceleration based convex optimization methods which provide better \mmode{ O \left( \nicefrac{1}{k^2} \right) } theoretical convergence guarantee. However, to fully exploit the utility of these reformulations, we also provide a novel algorithm, to which we refer as the Fast Linear Inverse Problem Solver (FLIPS), that is tailored to solve the reformulation of the LIP. We demonstrate the performance of FLIPS on the sparse coding problem arising in image processing tasks. In this setting, we observe that FLIPS consistently outperforms the Chambolle-Pock and C-SALSA algorithms--two of the current best methods in the literature.

translated by 谷歌翻译

DeepAngle: Fast calculation of contact angles in tomography images using deep learning

Arash Rabbani , Chenhao Sun , Masoud Babaei , Vahid J. Niasar , Ryan T. Armstrong , Peyman Mostaghimi

分类：计算机视觉

2022-11-28

DeepAngle is a machine learning-based method to determine the contact angles of different phases in the tomography images of porous materials. Measurement of angles in 3--D needs to be done within the surface perpendicular to the angle planes, and it could become inaccurate when dealing with the discretized space of the image voxels. A computationally intensive solution is to correlate and vectorize all surfaces using an adaptable grid, and then measure the angles within the desired planes. On the contrary, the present study provides a rapid and low-cost technique powered by deep learning to estimate the interfacial angles directly from images. DeepAngle is tested on both synthetic and realistic images against the direct measurement technique and found to improve the r-squared by 5 to 16% while lowering the computational cost 20 times. This rapid method is especially applicable for processing large tomography data and time-resolved images, which is computationally intensive. The developed code and the dataset are available at an open repository on GitHub (https://www.github.com/ArashRabbani/DeepAngle).

translated by 谷歌翻译

Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments

Joshua Knights , Kavisha Vidanapathirana , Milad Ramezani , Sridha Sridharan , Clinton Fookes , Peyman Moghadam

分类：机器人

2022-11-23

Many existing datasets for lidar place recognition are solely representative of structured urban environments, and have recently been saturated in performance by deep learning based approaches. Natural and unstructured environments present many additional challenges for the tasks of long-term localisation but these environments are not represented in currently available datasets. To address this we introduce Wild-Places, a challenging large-scale dataset for lidar place recognition in unstructured, natural environments. Wild-Places contains eight lidar sequences collected with a handheld sensor payload over the course of fourteen months, containing a total of 67K undistorted lidar submaps along with accurate 6DoF ground truth. Our dataset contains multiple revisits both within and between sequences, allowing for both intra-sequence (i.e. loop closure detection) and inter-sequence (i.e. re-localisation) place recognition. We also benchmark several state-of-the-art approaches to demonstrate the challenges that this dataset introduces, particularly the case of long-term place recognition due to natural environments changing over time. Our dataset and code will be available at https://csiro-robotics.github.io/Wild-Places.

translated by 谷歌翻译

Soft Diffusion: Score Matching for General Corruptions

Giannis Daras , Mauricio Delbracio , Hossein Talebi , Alexandros G. Dimakis , Peyman Milanfar

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-12

我们定义了更广泛的腐败过程，该过程概括了先前已知的扩散模型。为了扭转这些一般的扩散，我们提出了一个称为“软得分匹配”的新目标，可以证明可以学习任何线性腐败过程的得分功能，并为Celeba提供最先进的结果。软得分匹配结合了网络中的降解过程，并训练模型以预测腐败与扩散观察相匹配的干净图像。我们表明，我们的目标在适当的规律性条件下为腐败过程的家庭学习了可能性的梯度。我们进一步开发了一种原则性的方法，以选择一般扩散过程的损坏水平和一种我们称为动量采样器的新型抽样方法。我们评估了我们的框架，腐败是高斯模糊和低幅度添加噪声。我们的方法在Celeba-64上获得了最先进的FID得分$ 1.85 $，表现优于所有以前的线性扩散模型。与香草deno的扩散相比，我们还显示出显着的计算益处。

translated by 谷歌翻译

Adversarial contamination of networks in the setting of vertex nomination: a new trimming method

Sheyda Peyman , Minh Tang , Vince Lyzinski

分类： (统计)机器学习 | 机器学习

2022-08-20

随着图形数据变得越来越普遍，在这些复杂的数据域中进行可靠的推论图算法的需求至关重要。在许多感兴趣的情况下，存在对抗数据污染的情况更加复杂。对手的效果通常是以负面影响统计和算法性能的方式改变数据分布。我们在顶点提名的背景下研究这种现象，这是网络数据的半监督信息检索任务。在这里，一系列常见的方法依赖于光谱图嵌入，这些嵌入式既可以提供良好的算法性能和灵活的设置，在该设置中可以实现正则化技术以帮助减轻对手的效果。许多当前的正则化方法依赖于直接网络修剪来有效消除对抗性污染，尽管这种直接修剪通常会导致所得图中的复杂依赖性结构。我们提出了一种在模型空间中运行的新修剪方法，该方法可以解决块结构污染和白噪声污染（污染的分布未知）。与直接修剪相比，该模型修剪更适合理论分析，同时也证明了许多模拟的性能。

translated by 谷歌翻译

A Real-time Edge-AI System for Reef Surveys

Yang Li , Jiajun Liu , Brano Kusy , Ross Marchant , Brendan Do , Torsten Merz , Joey Crosswell , Andy Steven , Lachlan Tychsen-Smith , David Ahmedt-Aristizabal

分类：机器学习

2022-08-01

th骨海星（COTS）爆发是大屏障礁（GBR）珊瑚损失的主要原因，并且正在进行实质性的监视和控制计划，以将COTS人群管理至生态可持续的水平。在本文中，我们在边缘设备上介绍了基于水下的水下数据收集和策展系统，以进行COTS监视。特别是，我们利用了基于深度学习的对象检测技术的功能，并提出了一种资源有效的COTS检测器，该检测器在边缘设备上执行检测推断，以帮助海上专家在数据收集阶段进行COTS识别。初步结果表明，可以将改善计算效率的几种策略（例如，批处理处理，帧跳过，模型输入大小）组合在一起，以在Edge硬件上运行拟议的检测模型，资源消耗较低，信息损失较低。

translated by 谷歌翻译